WIP: The Coordinated Generation of Multimodal Presentations from a Common Representation
نویسندگان
چکیده
The task of the knowledge-based presentation system WIP is the generation of a variety of multimodal documents from an input consisting of a formal description of the communicative intent of a planned presentation. WIP generates illustrated texts that are customized for the intended audience and situation. We present the architecture of WIP and introduce as its major components the presentation planner, the layout manager, the text generator and the graphics generator. An extended notion of coherence for multimodal documents is introduced that can be used to constrain the presentation planning process. The paper focuses on the coordination of contents planning and layout that is necessary to produce a coherent illustrated text. In particular, we discuss layout revisions after contents planning and the influence of layout constraints on text generation. We show that in WIP the design of a multimodal document is viewed as a non-monotonic planning process that includes various revisions of preliminary results in order to achieve a coherent output with an optimal media mix.
منابع مشابه
Automatic Design of Multimodal Resentations
We describe our attempt to integrate multiple AI components such as planning, knowledge representation, natural language generation, and graphics generation into a functioning prototype called WIP that plans and coordinates multimodal presentations in which all material is generated by the system. WIP allows the generation of alternate presentations of the same content taking into account vario...
متن کاملDesigning Illustrated Texts: How Language Production Is Influenced By Graphics Generation
Multimodal interfaces combining, e.g., natural language and graphics take advantage of both the individual strength of each communication mode and the fact that several modes can be employed in parallel, e.g., in the text-picture combinations of illustrated documents. It is an important goal of this research not simply to merge the verbalization results of a natural language generator and the v...
متن کاملWIP: The Automatic Synthesis of Multimodal Presentations
Due to the growing complexity of information that has to be communicated by current AI systems, there comes an increasing need for building advanced intel l igent user interfaces that take advantage of a coordinated combination of different modalities, e.g., natural language, graphics, and animation, to produce situated and user-adaptive presentations. A deeper understanding of the basic princi...
متن کاملExplorations in a Natural Language Multimodal Information Access Environment
11 multimedia hypertext has become common, especially in musea, has as a consequence that it is among the rst ones to show its limits. Intelligent interfaces and especially the use of linguistic communication may ooer something concrete even at this early stage of our understanding of multimodal interaction. The coordinated generation of multimodal presentations from a common representation. 10...
متن کاملA Critical Visual Analysis of Gender Representation of ELT Materials from a Multimodal Perspective
This content analysis study, employing a multimodal perspective and critical visual analysis, set out to analyze gender representations in Top Notch series, one of the highly used ELT textbooks in Iran. For this purpose, six images were selected from these series and analyzed in terms of ‘representational’, ‘interactive’ and ‘compositional’ modes of meanings. The result indicated that there are...
متن کامل